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1  Statement  of  the  Problem  Studied 

Isaac  Levi,  a  philosopher  at  Coliunbia  University,  has  developed  an  epistemolog¬ 
ical  model  of  the  process  by  which  a  rational  decision  agent  may  acquire  new  in¬ 
formation.  Levi's  model  addresses  the  goals  and  values  of  the  rational  agent,  the 
agent's  willingness  to  risk  error,  and  the  representation  of  incomplete  and  con¬ 
tradictory  evidence.  His  model  is  based  on  the  representation  of  knowledge  and 
value  systems  using  convex  sets  of  probability  and  informational  distributions  and 
on  the  use  of  an  epistemic  utility  function  that  balances  the  risk  of  error  against  the 
desire  for  new  information;  his  model  includes  classical  Bayesian  decision  theory 
as  a  special  case. 

A  potential  drawback  to  practical  application  of  Levis  theory  is  the  computa¬ 
tional  complexity  associated  with  computing  convex  sets  of  probability  distribu¬ 
tions.  Bayesian  networks,  developed  by  Judea  Pearl,  address  similar  complexity 
problems  in  Bayesian  decision  theory  and  Dempster/Shafer  evidential  reasoning. 

The  problem  studied  in  this  work  was  the  use  of  Bayesian  networks  to  im¬ 
plement  Levi's  theory.  Initially,  the  emphasis  was  both  to  develop  the  theoreti¬ 
cal  formdation  necessary  to  apply  Bayesian  network  structures  to  Levi's  decision 
theory  and  to  develop  practical  network  updating  algorithms.  As  the  work  pro¬ 
gressed  and  issues  associated  with  application  of  Levi's  theory  were  settled,  the 
emphasis  shifted  to  methods  of  implementing  Bayesian  networks  that  contained 
both  continuous-valued  and  discrete-valued  nodes.  The  issue  of  continuous  and 
mixed  continuous-discrete  networks  is  generally  pertinent  to  decision  problems, 
using  either  Levi's  theory  or  classical  Bayesian  decision  theory;  the  method  of  im¬ 
plementing  continuous  and  continuous-discrete  networks  developed  in  the  course 
of  this  project  is  applicable  to  both  decision  approaches. 

2  Summary  of  Results 

The  results  of  this  project  can  be  roughly  categorized  as  falling  into  one  of  three 
areas: 

•  Implementation  of  Levi's  theory  using  Bayesian  networks, 

•  Development  of  Bayesian  network  updating  algorithms  for  continuous  val¬ 
ued  network  nodes,  and 

•  Application  of  continuous  Bayesian  networks  to  stochastic  filtering  prob¬ 
lems. 

In  the  following  sections,  we  describe  each  of  these  results  in  more  detail. 
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2. 1  Bayesian  Networks  and  Levi's  Theory 


A  key  point  in  Levi's  development  of  his  theory  is  that  sets  of  distributions  used 
in  the  decision  process  must  be  convex.  Thus,  we  investigated  whether  the  com¬ 
putations  performed  in  Bayesian  networks  preserve  the  convexity  of  sets  of  distri¬ 
butions.  We  found  that,  in  general,  these  computations  do  not  preserve  convexity; 
however,  there  are  significant  special  cases  in  which  convexity  is  preserved. 

In  light  of  these  results,  we  addressed  the  question  of  whether  Levi's  theory 
could  be  implemented  using  Bayesian  networks.  To  this  end,  we  showed  that  ap¬ 
plication  of  the  decision  rule  that  follows  from  Levi's  model  (called  the  rule  of 
expected  utility)  to  non-convex  sets  of  distributions  gives  the  same  decision  as  is 
obtained  using  the  convex  closure  of  these  sets.  Thus,  the  preservation  of  convex¬ 
ity  by  a  network  structure  is  not  necessary  when  applying  Levi's  rule  of  expected 
utility.  An  important  consequence  of  this  result  is  that,  to  implement  Levi's  deci¬ 
sion  rule,  a  Bayesian  network  need  compute  only  extreme  distributions  and  not  all 
distributions  in  a  set. 

2.2  Continuous  and  Continuous/ Discrete  Networks 

Bayesian  networks  that  contain  continuous  valued  nodes  (i.e.  continuous  valued 
random  variables)  are  applicable  to  a  much  broader  range  of  problems  than  net¬ 
works  containing  only  discrete  nodes.  Thus,  we  investigated  methods  by  which 
Bayesian  networks  can  be  extended  to  continuous  nodes.  In  particular,  we  inves¬ 
tigated  approximations  of  density  functions  associated  with  continuous  random 
variables.  The  approach  showing  the  most  promise  out  of  the  several  considered 
is  to  approximate  density  functions  by  weighted  sums  of  Gaussian  densities,  and 
this  is  the  approach  that  we  adopted  for  further  study. 

Using  Gaussian  sum  approximations,  we  derived  network  updating  algorithms 
for  nodes  whose  values  depend  on  their  parent  nodes  in  both  linear  and  non-linear 
relationships.  Implementation  of  these  update  equations  requires  that  the  network 
compute  weighting  coefficients,  means,  and  variances  of  Gaussian  distributions; 
these  computations  are  simply  performed  using  the  network  structure.  This  ap¬ 
proximation  approach  also  allows  the  formation  of  hybrid  networks  with  both 
continuous  and  discrete  nodes  without  imposing  restrictions  on  the  relationships 
allowable  between  continuous  and  discrete  variables,  as  is  the  case  with  previous 
work  on  hybrid  networks.  The  use  of  Gaussian  sum  approximations  for  continu¬ 
ous  nodes  is  applicable  in  any  context  in  which  a  Bayesian  network  may  be  applied 
and,  in  particular,  is  not  restricted  to  networks  used  to  implement  Levi's  theory. 

We  also  addressed  the  difficult  problem  of  computing  good  Gaussian  sum  ap¬ 
proximations  to  densities.  We  investigated  several  different  possible  techniques, 
including  gradient  descent,  annealing  algorithms,  and  expectation-maximization 
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algorithms.  We  obtained  the  best  approximatioris  using  a  gradient  descent  method 
which  incrementally  increases  the  number  of  Gaussians  to  refine  the  approxima¬ 
tion  until  a  desired  accuracy  is  obtained.  This  approach  was  also  used  to  re¬ 
approximate  densities  after  network  computations  to  reduce  the  number  of  Gaus¬ 
sians  in  approximations. 

2.3  Stochastic  Filtering 

We  investigated  the  the  application  of  Bayesian  networks  to  stochastic  filtering 
problems;  these  are  problems  for  which  the  Kalman  filter  might  be  considered  as  a 
potential  solution  and  include  target  recognition  and  tracking,  sensor  fusion,  and 
terrain  aided  navigation.  The  application  of  Bayesian  networks  to  stochastic  filter¬ 
ing  facilitates  the  fusion  of  classical  stochastic  filtering  techniques  with  imcertain 
reasoning  techniques,  allowing  symbolic  as  well  as  quantitative  information  to  be 
used  in  file  filtering  process.  A  stochastic  filtering  problem  can  often  be  formu¬ 
lated  as  estimating  the  state  of  a  Hidden  Markov  Model  (HMM).  Since  a  HMM 
is  a  particular  form  of  a  Bayesian  network,  our  previously  developed  continuous 
Bayesian  networks  can  be  applied  to  this  problem;  however,  the  number  of  nodes 
in  file  network  increases  linearly  with  the  number  of  observations  processed  (i.e. 
the  network  size  increases  as  a  function  of  time).  In  our  work,we  addressed  this 
problem  through  a  filter  control  system  that  stores  densities  for  use  in  subsequent 
computations;  this  allow  the  Bayesian  network  size  to  remain  fixed. 

We  applied  this  filtering  approach  to  an  angle  only  target  tracking  problem.  In 
this  problem,  a  moving  target  is  tracked  using  a  sequence  of  bearings-only  mea¬ 
surements.  This  problem  is  a  particularly  difficult  filtering  problem  because  the 
observations  do  not  provide  enough  information  to  completely  locate  the  target 
and  because  the  observations  are  a  non-linear  function  of  the  target  position;  thus, 
classical  approaches  such  as  fiie  extended  Kalman  filter  cannot  be  successfully  ap¬ 
plied.  We  developed  a  Bayesian  network  and  corresponding  filter  control  system 
for  this  problem,  and,  using  simulation,  compared  its  performance  (in  terms  of 
position  estimate  accuracy  and  filter  convergence)  to  the  extended  Kalman  filter. 
Our  tracker  was  significantly  more  robust  than  the  extended  Kalman  filter,  and  our 
tracker  provided  much  more  accurate  estimates  of  the  target  position. 
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